Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 41176 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.9 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 10 |
| Boolean | 1 |
df_index is highly correlated with emp.var.rate and 3 other fields | High correlation |
pdays is highly correlated with previous | High correlation |
previous is highly correlated with pdays | High correlation |
emp.var.rate is highly correlated with df_index and 3 other fields | High correlation |
cons.price.idx is highly correlated with df_index and 1 other fields | High correlation |
euribor3m is highly correlated with df_index and 2 other fields | High correlation |
nr.employed is highly correlated with df_index and 2 other fields | High correlation |
df_index is highly correlated with emp.var.rate and 3 other fields | High correlation |
pdays is highly correlated with previous | High correlation |
previous is highly correlated with pdays and 1 other fields | High correlation |
emp.var.rate is highly correlated with df_index and 3 other fields | High correlation |
cons.price.idx is highly correlated with df_index and 3 other fields | High correlation |
euribor3m is highly correlated with df_index and 3 other fields | High correlation |
nr.employed is highly correlated with df_index and 4 other fields | High correlation |
df_index is highly correlated with cons.price.idx | High correlation |
pdays is highly correlated with previous | High correlation |
previous is highly correlated with pdays | High correlation |
emp.var.rate is highly correlated with cons.price.idx and 2 other fields | High correlation |
cons.price.idx is highly correlated with df_index and 1 other fields | High correlation |
euribor3m is highly correlated with emp.var.rate and 1 other fields | High correlation |
nr.employed is highly correlated with emp.var.rate and 1 other fields | High correlation |
housing is highly correlated with loan | High correlation |
loan is highly correlated with housing | High correlation |
month is highly correlated with contact | High correlation |
contact is highly correlated with month | High correlation |
df_index is highly correlated with contact and 9 other fields | High correlation |
age is highly correlated with job | High correlation |
job is highly correlated with age and 1 other fields | High correlation |
education is highly correlated with job | High correlation |
housing is highly correlated with loan | High correlation |
loan is highly correlated with housing | High correlation |
contact is highly correlated with df_index and 6 other fields | High correlation |
month is highly correlated with df_index and 6 other fields | High correlation |
pdays is highly correlated with df_index and 5 other fields | High correlation |
previous is highly correlated with pdays and 2 other fields | High correlation |
poutcome is highly correlated with df_index and 7 other fields | High correlation |
emp.var.rate is highly correlated with df_index and 7 other fields | High correlation |
cons.price.idx is highly correlated with df_index and 7 other fields | High correlation |
cons.conf.idx is highly correlated with df_index and 9 other fields | High correlation |
euribor3m is highly correlated with df_index and 10 other fields | High correlation |
nr.employed is highly correlated with df_index and 9 other fields | High correlation |
output is highly correlated with df_index and 3 other fields | High correlation |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
previous has 35551 (86.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-06-20 13:41:59.527402 |
|---|---|
| Analysis finished | 2022-06-20 13:42:34.036175 |
| Duration | 34.51 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 41176 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20593.05673 |
| Minimum | 0 |
|---|---|
| Maximum | 41187 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2059.75 |
| Q1 | 10294.75 |
| median | 20594.5 |
| Q3 | 30890.25 |
| 95-th percentile | 39128.25 |
| Maximum | 41187 |
| Range | 41187 |
| Interquartile range (IQR) | 20595.5 |
Descriptive statistics
| Standard deviation | 11890.49312 |
|---|---|
| Coefficient of variation (CV) | 0.5774030187 |
| Kurtosis | -1.200129397 |
| Mean | 20593.05673 |
| Median Absolute Deviation (MAD) | 10298 |
| Skewness | 5.289244425 × 10-5 |
| Sum | 847939704 |
| Variance | 141383826.7 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 27512 | 1 | < 0.1% |
| 27454 | 1 | < 0.1% |
| 27455 | 1 | < 0.1% |
| 27456 | 1 | < 0.1% |
| 27457 | 1 | < 0.1% |
| 27458 | 1 | < 0.1% |
| 27459 | 1 | < 0.1% |
| 27460 | 1 | < 0.1% |
| 27461 | 1 | < 0.1% |
| Other values (41166) | 41166 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 41187 | 1 | |
| 41186 | 1 | |
| 41185 | 1 | |
| 41184 | 1 | |
| 41183 | 1 | |
| 41182 | 1 | |
| 41181 | 1 | |
| 41180 | 1 | |
| 41179 | 1 | |
| 41178 | 1 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.02380027 |
| Minimum | 17 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 32 |
| median | 38 |
| Q3 | 47 |
| 95-th percentile | 58 |
| Maximum | 98 |
| Range | 81 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.42067987 |
|---|---|
| Coefficient of variation (CV) | 0.2603620795 |
| Kurtosis | 0.7911133226 |
| Mean | 40.02380027 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.7845602604 |
| Sum | 1648020 |
| Variance | 108.5905689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 1947 | 4.7% |
| 32 | 1845 | 4.5% |
| 33 | 1833 | 4.5% |
| 36 | 1779 | 4.3% |
| 35 | 1758 | 4.3% |
| 34 | 1745 | 4.2% |
| 30 | 1714 | 4.2% |
| 37 | 1475 | 3.6% |
| 29 | 1453 | 3.5% |
| 39 | 1430 | 3.5% |
| Other values (68) | 24197 |
| Value | Count | Frequency (%) |
| 17 | 5 | < 0.1% |
| 18 | 28 | 0.1% |
| 19 | 42 | 0.1% |
| 20 | 65 | 0.2% |
| 21 | 102 | 0.2% |
| 22 | 137 | 0.3% |
| 23 | 226 | 0.5% |
| 24 | 462 | |
| 25 | 598 | |
| 26 | 698 |
| Value | Count | Frequency (%) |
| 98 | 2 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 92 | 4 | < 0.1% |
| 91 | 2 | < 0.1% |
| 89 | 2 | < 0.1% |
| 88 | 22 | |
| 87 | 1 | < 0.1% |
| 86 | 8 | < 0.1% |
| 85 | 15 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| admin | |
|---|---|
| blue-collar | |
| technician | |
| services | |
| management | |
| Other values (7) |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 8.702399456 |
| Min length | 5 |
Characters and Unicode
| Total characters | 358330 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | housemaid |
|---|---|
| 2nd row | services |
| 3rd row | services |
| 4th row | admin |
| 5th row | services |
Common Values
| Value | Count | Frequency (%) |
| admin | 10419 | |
| blue-collar | 9253 | |
| technician | 6739 | |
| services | 3967 | 9.6% |
| management | 2924 | 7.1% |
| retired | 1718 | 4.2% |
| entrepreneur | 1456 | 3.5% |
| self-employed | 1421 | 3.5% |
| housemaid | 1060 | 2.6% |
| unemployed | 1014 | 2.5% |
| Other values (2) | 1205 | 2.9% |
Length
| Value | Count | Frequency (%) |
| admin | 10419 | |
| blue-collar | 9253 | |
| technician | 6739 | |
| services | 3967 | 9.6% |
| management | 2924 | 7.1% |
| retired | 1718 | 4.2% |
| entrepreneur | 1456 | 3.5% |
| self-employed | 1421 | 3.5% |
| housemaid | 1060 | 2.6% |
| unemployed | 1014 | 2.5% |
| Other values (2) | 1205 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 47260 | |
| n | 35536 | |
| a | 33319 | |
| l | 31615 | 8.8% |
| i | 30642 | 8.6% |
| c | 26698 | 7.5% |
| r | 21024 | 5.9% |
| m | 19762 | 5.5% |
| d | 16507 | 4.6% |
| t | 14587 | 4.1% |
| Other values (13) | 81380 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 347656 | |
| Dash Punctuation | 10674 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 47260 | |
| n | 35536 | |
| a | 33319 | |
| l | 31615 | |
| i | 30642 | |
| c | 26698 | 7.7% |
| r | 21024 | 6.0% |
| m | 19762 | 5.7% |
| d | 16507 | 4.7% |
| t | 14587 | 4.2% |
| Other values (12) | 70706 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 347656 | |
| Common | 10674 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 47260 | |
| n | 35536 | |
| a | 33319 | |
| l | 31615 | |
| i | 30642 | |
| c | 26698 | 7.7% |
| r | 21024 | 6.0% |
| m | 19762 | 5.7% |
| d | 16507 | 4.7% |
| t | 14587 | 4.2% |
| Other values (12) | 70706 |
Common
| Value | Count | Frequency (%) |
| - | 10674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 358330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 47260 | |
| n | 35536 | |
| a | 33319 | |
| l | 31615 | 8.8% |
| i | 30642 | 8.6% |
| c | 26698 | 7.5% |
| r | 21024 | 5.9% |
| m | 19762 | 5.5% |
| d | 16507 | 4.6% |
| t | 14587 | 4.1% |
| Other values (13) | 81380 |
marital
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| married | |
|---|---|
| single | |
| divorced | |
| unknown | 80 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.831139499 |
| Min length | 6 |
Characters and Unicode
| Total characters | 281279 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | married |
|---|---|
| 2nd row | married |
| 3rd row | married |
| 4th row | married |
| 5th row | married |
Common Values
| Value | Count | Frequency (%) |
| married | 24921 | |
| single | 11564 | |
| divorced | 4611 | 11.2% |
| unknown | 80 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| married | 24921 | |
| single | 11564 | |
| divorced | 4611 | 11.2% |
| unknown | 80 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 54453 | |
| i | 41096 | |
| e | 41096 | |
| d | 34143 | |
| m | 24921 | |
| a | 24921 | |
| n | 11804 | 4.2% |
| s | 11564 | 4.1% |
| g | 11564 | 4.1% |
| l | 11564 | 4.1% |
| Other values (6) | 14153 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 281279 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 54453 | |
| i | 41096 | |
| e | 41096 | |
| d | 34143 | |
| m | 24921 | |
| a | 24921 | |
| n | 11804 | 4.2% |
| s | 11564 | 4.1% |
| g | 11564 | 4.1% |
| l | 11564 | 4.1% |
| Other values (6) | 14153 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 281279 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 54453 | |
| i | 41096 | |
| e | 41096 | |
| d | 34143 | |
| m | 24921 | |
| a | 24921 | |
| n | 11804 | 4.2% |
| s | 11564 | 4.1% |
| g | 11564 | 4.1% |
| l | 11564 | 4.1% |
| Other values (6) | 14153 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 281279 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 54453 | |
| i | 41096 | |
| e | 41096 | |
| d | 34143 | |
| m | 24921 | |
| a | 24921 | |
| n | 11804 | 4.2% |
| s | 11564 | 4.1% |
| g | 11564 | 4.1% |
| l | 11564 | 4.1% |
| Other values (6) | 14153 | 5.0% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| university.degree | |
|---|---|
| high.school | |
| basic.9y | |
| professional.course | |
| basic.4y | |
| Other values (3) |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 12.71046241 |
| Min length | 7 |
Characters and Unicode
| Total characters | 523366 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | basic.4y |
|---|---|
| 2nd row | high.school |
| 3rd row | high.school |
| 4th row | basic.6y |
| 5th row | high.school |
Common Values
| Value | Count | Frequency (%) |
| university.degree | 12164 | |
| high.school | 9512 | |
| basic.9y | 6045 | |
| professional.course | 5240 | |
| basic.4y | 4176 | 10.1% |
| basic.6y | 2291 | 5.6% |
| unknown | 1730 | 4.2% |
| illiterate | 18 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| university.degree | 12164 | |
| high.school | 9512 | |
| basic.9y | 6045 | |
| professional.course | 5240 | |
| basic.4y | 4176 | 10.1% |
| basic.6y | 2291 | 5.6% |
| unknown | 1730 | 4.2% |
| illiterate | 18 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 59172 | 11.3% |
| i | 51628 | 9.9% |
| s | 49908 | 9.5% |
| . | 39428 | 7.5% |
| o | 36474 | 7.0% |
| r | 34826 | 6.7% |
| h | 28536 | 5.5% |
| c | 27264 | 5.2% |
| y | 24676 | 4.7% |
| n | 22594 | 4.3% |
| Other values (15) | 148860 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 471426 | |
| Other Punctuation | 39428 | 7.5% |
| Decimal Number | 12512 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 59172 | |
| i | 51628 | |
| s | 49908 | |
| o | 36474 | 7.7% |
| r | 34826 | 7.4% |
| h | 28536 | 6.1% |
| c | 27264 | 5.8% |
| y | 24676 | 5.2% |
| n | 22594 | 4.8% |
| g | 21676 | 4.6% |
| Other values (11) | 114672 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6045 | |
| 4 | 4176 | |
| 6 | 2291 | 18.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39428 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 471426 | |
| Common | 51940 | 9.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 59172 | |
| i | 51628 | |
| s | 49908 | |
| o | 36474 | 7.7% |
| r | 34826 | 7.4% |
| h | 28536 | 6.1% |
| c | 27264 | 5.8% |
| y | 24676 | 5.2% |
| n | 22594 | 4.8% |
| g | 21676 | 4.6% |
| Other values (11) | 114672 |
Common
| Value | Count | Frequency (%) |
| . | 39428 | |
| 9 | 6045 | 11.6% |
| 4 | 4176 | 8.0% |
| 6 | 2291 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 523366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 59172 | 11.3% |
| i | 51628 | 9.9% |
| s | 49908 | 9.5% |
| . | 39428 | 7.5% |
| o | 36474 | 7.0% |
| r | 34826 | 6.7% |
| h | 28536 | 5.5% |
| c | 27264 | 5.2% |
| y | 24676 | 4.7% |
| n | 22594 | 4.3% |
| Other values (15) | 148860 |
default
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| no | |
|---|---|
| unknown | |
| yes | 3 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 3.043884787 |
| Min length | 2 |
Characters and Unicode
| Total characters | 125335 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | unknown |
| 3rd row | no |
| 4th row | no |
| 5th row | no |
Common Values
| Value | Count | Frequency (%) |
| no | 32577 | |
| unknown | 8596 | 20.9% |
| yes | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| no | 32577 | |
| unknown | 8596 | 20.9% |
| yes | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 58365 | |
| o | 41173 | |
| u | 8596 | 6.9% |
| k | 8596 | 6.9% |
| w | 8596 | 6.9% |
| y | 3 | < 0.1% |
| e | 3 | < 0.1% |
| s | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125335 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 58365 | |
| o | 41173 | |
| u | 8596 | 6.9% |
| k | 8596 | 6.9% |
| w | 8596 | 6.9% |
| y | 3 | < 0.1% |
| e | 3 | < 0.1% |
| s | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 125335 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 58365 | |
| o | 41173 | |
| u | 8596 | 6.9% |
| k | 8596 | 6.9% |
| w | 8596 | 6.9% |
| y | 3 | < 0.1% |
| e | 3 | < 0.1% |
| s | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125335 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 58365 | |
| o | 41173 | |
| u | 8596 | 6.9% |
| k | 8596 | 6.9% |
| w | 8596 | 6.9% |
| y | 3 | < 0.1% |
| e | 3 | < 0.1% |
| s | 3 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| yes | |
|---|---|
| no | |
| unknown | 990 |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 2.64408879 |
| Min length | 2 |
Characters and Unicode
| Total characters | 108873 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | no |
| 3rd row | yes |
| 4th row | no |
| 5th row | no |
Common Values
| Value | Count | Frequency (%) |
| yes | 21571 | |
| no | 18615 | |
| unknown | 990 | 2.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 21571 | |
| no | 18615 | |
| unknown | 990 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 21585 | |
| y | 21571 | |
| e | 21571 | |
| s | 21571 | |
| o | 19605 | |
| u | 990 | 0.9% |
| k | 990 | 0.9% |
| w | 990 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 108873 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 21585 | |
| y | 21571 | |
| e | 21571 | |
| s | 21571 | |
| o | 19605 | |
| u | 990 | 0.9% |
| k | 990 | 0.9% |
| w | 990 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 108873 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 21585 | |
| y | 21571 | |
| e | 21571 | |
| s | 21571 | |
| o | 19605 | |
| u | 990 | 0.9% |
| k | 990 | 0.9% |
| w | 990 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108873 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 21585 | |
| y | 21571 | |
| e | 21571 | |
| s | 21571 | |
| o | 19605 | |
| u | 990 | 0.9% |
| k | 990 | 0.9% |
| w | 990 | 0.9% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| no | |
|---|---|
| yes | |
| unknown | 990 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.271954537 |
| Min length | 2 |
Characters and Unicode
| Total characters | 93550 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | no |
| 3rd row | no |
| 4th row | no |
| 5th row | yes |
Common Values
| Value | Count | Frequency (%) |
| no | 33938 | |
| yes | 6248 | 15.2% |
| unknown | 990 | 2.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| no | 33938 | |
| yes | 6248 | 15.2% |
| unknown | 990 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 36908 | |
| o | 34928 | |
| y | 6248 | 6.7% |
| e | 6248 | 6.7% |
| s | 6248 | 6.7% |
| u | 990 | 1.1% |
| k | 990 | 1.1% |
| w | 990 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93550 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 36908 | |
| o | 34928 | |
| y | 6248 | 6.7% |
| e | 6248 | 6.7% |
| s | 6248 | 6.7% |
| u | 990 | 1.1% |
| k | 990 | 1.1% |
| w | 990 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 93550 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 36908 | |
| o | 34928 | |
| y | 6248 | 6.7% |
| e | 6248 | 6.7% |
| s | 6248 | 6.7% |
| u | 990 | 1.1% |
| k | 990 | 1.1% |
| w | 990 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 36908 | |
| o | 34928 | |
| y | 6248 | 6.7% |
| e | 6248 | 6.7% |
| s | 6248 | 6.7% |
| u | 990 | 1.1% |
| k | 990 | 1.1% |
| w | 990 | 1.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| cellular | |
|---|---|
| telephone |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.365285603 |
| Min length | 8 |
Characters and Unicode
| Total characters | 344449 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | telephone |
|---|---|
| 2nd row | telephone |
| 3rd row | telephone |
| 4th row | telephone |
| 5th row | telephone |
Common Values
| Value | Count | Frequency (%) |
| cellular | 26135 | |
| telephone | 15041 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| cellular | 26135 | |
| telephone | 15041 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 93446 | |
| e | 71258 | |
| c | 26135 | 7.6% |
| u | 26135 | 7.6% |
| a | 26135 | 7.6% |
| r | 26135 | 7.6% |
| t | 15041 | 4.4% |
| p | 15041 | 4.4% |
| h | 15041 | 4.4% |
| o | 15041 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 344449 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 93446 | |
| e | 71258 | |
| c | 26135 | 7.6% |
| u | 26135 | 7.6% |
| a | 26135 | 7.6% |
| r | 26135 | 7.6% |
| t | 15041 | 4.4% |
| p | 15041 | 4.4% |
| h | 15041 | 4.4% |
| o | 15041 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 344449 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 93446 | |
| e | 71258 | |
| c | 26135 | 7.6% |
| u | 26135 | 7.6% |
| a | 26135 | 7.6% |
| r | 26135 | 7.6% |
| t | 15041 | 4.4% |
| p | 15041 | 4.4% |
| h | 15041 | 4.4% |
| o | 15041 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 344449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 93446 | |
| e | 71258 | |
| c | 26135 | 7.6% |
| u | 26135 | 7.6% |
| a | 26135 | 7.6% |
| r | 26135 | 7.6% |
| t | 15041 | 4.4% |
| p | 15041 | 4.4% |
| h | 15041 | 4.4% |
| o | 15041 | 4.4% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (5) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 123528 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | may |
|---|---|
| 2nd row | may |
| 3rd row | may |
| 4th row | may |
| 5th row | may |
Common Values
| Value | Count | Frequency (%) |
| may | 13767 | |
| jul | 7169 | |
| aug | 6176 | |
| jun | 5318 | 12.9% |
| nov | 4100 | 10.0% |
| apr | 2631 | 6.4% |
| oct | 717 | 1.7% |
| sep | 570 | 1.4% |
| mar | 546 | 1.3% |
| dec | 182 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| may | 13767 | |
| jul | 7169 | |
| aug | 6176 | |
| jun | 5318 | 12.9% |
| nov | 4100 | 10.0% |
| apr | 2631 | 6.4% |
| oct | 717 | 1.7% |
| sep | 570 | 1.4% |
| mar | 546 | 1.3% |
| dec | 182 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 23120 | |
| u | 18663 | |
| m | 14313 | |
| y | 13767 | |
| j | 12487 | |
| n | 9418 | |
| l | 7169 | 5.8% |
| g | 6176 | 5.0% |
| o | 4817 | 3.9% |
| v | 4100 | 3.3% |
| Other values (7) | 9498 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 123528 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 23120 | |
| u | 18663 | |
| m | 14313 | |
| y | 13767 | |
| j | 12487 | |
| n | 9418 | |
| l | 7169 | 5.8% |
| g | 6176 | 5.0% |
| o | 4817 | 3.9% |
| v | 4100 | 3.3% |
| Other values (7) | 9498 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 123528 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 23120 | |
| u | 18663 | |
| m | 14313 | |
| y | 13767 | |
| j | 12487 | |
| n | 9418 | |
| l | 7169 | 5.8% |
| g | 6176 | 5.0% |
| o | 4817 | 3.9% |
| v | 4100 | 3.3% |
| Other values (7) | 9498 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 123528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 23120 | |
| u | 18663 | |
| m | 14313 | |
| y | 13767 | |
| j | 12487 | |
| n | 9418 | |
| l | 7169 | 5.8% |
| g | 6176 | 5.0% |
| o | 4817 | 3.9% |
| v | 4100 | 3.3% |
| Other values (7) | 9498 |
day_of_week
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| thu | |
|---|---|
| mon | |
| wed | |
| tue | |
| fri |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 123528 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | mon |
|---|---|
| 2nd row | mon |
| 3rd row | mon |
| 4th row | mon |
| 5th row | mon |
Common Values
| Value | Count | Frequency (%) |
| thu | 8618 | |
| mon | 8512 | |
| wed | 8134 | |
| tue | 8086 | |
| fri | 7826 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| thu | 8618 | |
| mon | 8512 | |
| wed | 8134 | |
| tue | 8086 | |
| fri | 7826 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 16704 | |
| u | 16704 | |
| e | 16220 | |
| h | 8618 | |
| m | 8512 | |
| o | 8512 | |
| n | 8512 | |
| w | 8134 | |
| d | 8134 | |
| f | 7826 | |
| Other values (2) | 15652 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 123528 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 16704 | |
| u | 16704 | |
| e | 16220 | |
| h | 8618 | |
| m | 8512 | |
| o | 8512 | |
| n | 8512 | |
| w | 8134 | |
| d | 8134 | |
| f | 7826 | |
| Other values (2) | 15652 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 123528 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 16704 | |
| u | 16704 | |
| e | 16220 | |
| h | 8618 | |
| m | 8512 | |
| o | 8512 | |
| n | 8512 | |
| w | 8134 | |
| d | 8134 | |
| f | 7826 | |
| Other values (2) | 15652 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 123528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 16704 | |
| u | 16704 | |
| e | 16220 | |
| h | 8618 | |
| m | 8512 | |
| o | 8512 | |
| n | 8512 | |
| w | 8134 | |
| d | 8134 | |
| f | 7826 | |
| Other values (2) | 15652 |
duration
Real number (ℝ≥0)
| Distinct | 1544 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 258.315815 |
| Minimum | 0 |
|---|---|
| Maximum | 4918 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 102 |
| median | 180 |
| Q3 | 319 |
| 95-th percentile | 753 |
| Maximum | 4918 |
| Range | 4918 |
| Interquartile range (IQR) | 217 |
Descriptive statistics
| Standard deviation | 259.305321 |
|---|---|
| Coefficient of variation (CV) | 1.003830605 |
| Kurtosis | 20.24377094 |
| Mean | 258.315815 |
| Median Absolute Deviation (MAD) | 94 |
| Skewness | 3.262807509 |
| Sum | 10636412 |
| Variance | 67239.24948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 170 | 0.4% |
| 85 | 170 | 0.4% |
| 136 | 168 | 0.4% |
| 73 | 167 | 0.4% |
| 124 | 163 | 0.4% |
| 87 | 162 | 0.4% |
| 72 | 161 | 0.4% |
| 104 | 161 | 0.4% |
| 111 | 160 | 0.4% |
| 106 | 159 | 0.4% |
| Other values (1534) | 39535 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 1 | 3 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 12 | < 0.1% |
| 5 | 30 | 0.1% |
| 6 | 37 | |
| 7 | 54 | |
| 8 | 69 | |
| 9 | 77 |
| Value | Count | Frequency (%) |
| 4918 | 1 | |
| 4199 | 1 | |
| 3785 | 1 | |
| 3643 | 1 | |
| 3631 | 1 | |
| 3509 | 1 | |
| 3422 | 1 | |
| 3366 | 1 | |
| 3322 | 1 | |
| 3284 | 1 |
campaign
Real number (ℝ≥0)
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.567879347 |
| Minimum | 1 |
|---|---|
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 56 |
| Range | 55 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.770318336 |
|---|---|
| Coefficient of variation (CV) | 1.078835086 |
| Kurtosis | 36.9718574 |
| Mean | 2.567879347 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.762044061 |
| Sum | 105735 |
| Variance | 7.674663685 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 17634 | |
| 2 | 10568 | |
| 3 | 5340 | 13.0% |
| 4 | 2650 | 6.4% |
| 5 | 1599 | 3.9% |
| 6 | 979 | 2.4% |
| 7 | 629 | 1.5% |
| 8 | 400 | 1.0% |
| 9 | 283 | 0.7% |
| 10 | 225 | 0.5% |
| Other values (32) | 869 | 2.1% |
| Value | Count | Frequency (%) |
| 1 | 17634 | |
| 2 | 10568 | |
| 3 | 5340 | 13.0% |
| 4 | 2650 | 6.4% |
| 5 | 1599 | 3.9% |
| 6 | 979 | 2.4% |
| 7 | 629 | 1.5% |
| 8 | 400 | 1.0% |
| 9 | 283 | 0.7% |
| 10 | 225 | 0.5% |
| Value | Count | Frequency (%) |
| 56 | 1 | < 0.1% |
| 43 | 2 | < 0.1% |
| 42 | 2 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 39 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 35 | 5 | |
| 34 | 3 | |
| 33 | 4 |
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 962.4648096 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 15 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 999 |
| Q1 | 999 |
| median | 999 |
| Q3 | 999 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 186.9371017 |
|---|---|
| Coefficient of variation (CV) | 0.1942274667 |
| Kurtosis | 22.22155279 |
| Mean | 962.4648096 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.921386382 |
| Sum | 39630451 |
| Variance | 34945.48 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999 | 39661 | |
| 3 | 439 | 1.1% |
| 6 | 412 | 1.0% |
| 4 | 118 | 0.3% |
| 9 | 64 | 0.2% |
| 2 | 61 | 0.1% |
| 7 | 60 | 0.1% |
| 12 | 58 | 0.1% |
| 10 | 52 | 0.1% |
| 5 | 46 | 0.1% |
| Other values (17) | 205 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 15 | < 0.1% |
| 1 | 26 | 0.1% |
| 2 | 61 | 0.1% |
| 3 | 439 | |
| 4 | 118 | 0.3% |
| 5 | 46 | 0.1% |
| 6 | 412 | |
| 7 | 60 | 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 64 | 0.2% |
| Value | Count | Frequency (%) |
| 999 | 39661 | |
| 27 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 22 | 3 | < 0.1% |
| 21 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 3 | < 0.1% |
| 18 | 7 | < 0.1% |
| 17 | 8 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1730134059 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 35551 |
| Zeros (%) | 86.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4949643814 |
|---|---|
| Coefficient of variation (CV) | 2.8608441 |
| Kurtosis | 20.10216376 |
| Mean | 0.1730134059 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.831395514 |
| Sum | 7124 |
| Variance | 0.2449897388 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35551 | |
| 1 | 4561 | 11.1% |
| 2 | 754 | 1.8% |
| 3 | 216 | 0.5% |
| 4 | 70 | 0.2% |
| 5 | 18 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 35551 | |
| 1 | 4561 | 11.1% |
| 2 | 754 | 1.8% |
| 3 | 216 | 0.5% |
| 4 | 70 | 0.2% |
| 5 | 18 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 18 | < 0.1% |
| 4 | 70 | 0.2% |
| 3 | 216 | 0.5% |
| 2 | 754 | 1.8% |
| 1 | 4561 | 11.1% |
| 0 | 35551 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 321.8 KiB |
| nonexistent | |
|---|---|
| failure | |
| success | 1373 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.45356518 |
| Min length | 7 |
Characters and Unicode
| Total characters | 430436 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nonexistent |
|---|---|
| 2nd row | nonexistent |
| 3rd row | nonexistent |
| 4th row | nonexistent |
| 5th row | nonexistent |
Common Values
| Value | Count | Frequency (%) |
| nonexistent | 35551 | |
| failure | 4252 | 10.3% |
| success | 1373 | 3.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| nonexistent | 35551 | |
| failure | 4252 | 10.3% |
| success | 1373 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 106653 | |
| e | 76727 | |
| t | 71102 | |
| i | 39803 | 9.2% |
| s | 39670 | 9.2% |
| o | 35551 | 8.3% |
| x | 35551 | 8.3% |
| u | 5625 | 1.3% |
| f | 4252 | 1.0% |
| a | 4252 | 1.0% |
| Other values (3) | 11250 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 430436 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 106653 | |
| e | 76727 | |
| t | 71102 | |
| i | 39803 | 9.2% |
| s | 39670 | 9.2% |
| o | 35551 | 8.3% |
| x | 35551 | 8.3% |
| u | 5625 | 1.3% |
| f | 4252 | 1.0% |
| a | 4252 | 1.0% |
| Other values (3) | 11250 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 430436 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 106653 | |
| e | 76727 | |
| t | 71102 | |
| i | 39803 | 9.2% |
| s | 39670 | 9.2% |
| o | 35551 | 8.3% |
| x | 35551 | 8.3% |
| u | 5625 | 1.3% |
| f | 4252 | 1.0% |
| a | 4252 | 1.0% |
| Other values (3) | 11250 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 430436 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 106653 | |
| e | 76727 | |
| t | 71102 | |
| i | 39803 | 9.2% |
| s | 39670 | 9.2% |
| o | 35551 | 8.3% |
| x | 35551 | 8.3% |
| u | 5625 | 1.3% |
| f | 4252 | 1.0% |
| a | 4252 | 1.0% |
| Other values (3) | 11250 | 2.6% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08192150767 |
| Minimum | -3.4 |
|---|---|
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 17186 |
| Negative (%) | 41.7% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -3.4 |
|---|---|
| 5-th percentile | -2.9 |
| Q1 | -1.8 |
| median | 1.1 |
| Q3 | 1.4 |
| 95-th percentile | 1.4 |
| Maximum | 1.4 |
| Range | 4.8 |
| Interquartile range (IQR) | 3.2 |
Descriptive statistics
| Standard deviation | 1.570882615 |
|---|---|
| Coefficient of variation (CV) | 19.17546026 |
| Kurtosis | -1.062698024 |
| Mean | 0.08192150767 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.7240605917 |
| Sum | 3373.2 |
| Variance | 2.467672189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.4 | 16228 | |
| -1.8 | 9182 | |
| 1.1 | 7762 | |
| -0.1 | 3682 | 8.9% |
| -2.9 | 1662 | 4.0% |
| -3.4 | 1070 | 2.6% |
| -1.7 | 773 | 1.9% |
| -1.1 | 635 | 1.5% |
| -3 | 172 | 0.4% |
| -0.2 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| -3.4 | 1070 | 2.6% |
| -3 | 172 | 0.4% |
| -2.9 | 1662 | 4.0% |
| -1.8 | 9182 | |
| -1.7 | 773 | 1.9% |
| -1.1 | 635 | 1.5% |
| -0.2 | 10 | < 0.1% |
| -0.1 | 3682 | 8.9% |
| 1.1 | 7762 | |
| 1.4 | 16228 |
| Value | Count | Frequency (%) |
| 1.4 | 16228 | |
| 1.1 | 7762 | |
| -0.1 | 3682 | 8.9% |
| -0.2 | 10 | < 0.1% |
| -1.1 | 635 | 1.5% |
| -1.7 | 773 | 1.9% |
| -1.8 | 9182 | |
| -2.9 | 1662 | 4.0% |
| -3 | 172 | 0.4% |
| -3.4 | 1070 | 2.6% |
cons.price.idx
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.57571989 |
| Minimum | 92.201 |
|---|---|
| Maximum | 94.767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 92.201 |
|---|---|
| 5-th percentile | 92.713 |
| Q1 | 93.075 |
| median | 93.749 |
| Q3 | 93.994 |
| 95-th percentile | 94.465 |
| Maximum | 94.767 |
| Range | 2.566 |
| Interquartile range (IQR) | 0.919 |
Descriptive statistics
| Standard deviation | 0.5788389856 |
|---|---|
| Coefficient of variation (CV) | 0.006185781806 |
| Kurtosis | -0.8298510691 |
| Mean | 93.57571989 |
| Median Absolute Deviation (MAD) | 0.38 |
| Skewness | -0.2308529068 |
| Sum | 3853073.842 |
| Variance | 0.3350545712 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 93.994 | 7762 | |
| 93.918 | 6681 | |
| 92.893 | 5793 | |
| 93.444 | 5173 | |
| 94.465 | 4374 | |
| 93.2 | 3615 | |
| 93.075 | 2457 | 6.0% |
| 92.201 | 770 | 1.9% |
| 92.963 | 715 | 1.7% |
| 92.431 | 446 | 1.1% |
| Other values (16) | 3390 |
| Value | Count | Frequency (%) |
| 92.201 | 770 | 1.9% |
| 92.379 | 267 | 0.6% |
| 92.431 | 446 | 1.1% |
| 92.469 | 177 | 0.4% |
| 92.649 | 357 | 0.9% |
| 92.713 | 172 | 0.4% |
| 92.756 | 10 | < 0.1% |
| 92.843 | 282 | 0.7% |
| 92.893 | 5793 | |
| 92.963 | 715 | 1.7% |
| Value | Count | Frequency (%) |
| 94.767 | 128 | 0.3% |
| 94.601 | 204 | 0.5% |
| 94.465 | 4374 | |
| 94.215 | 311 | 0.8% |
| 94.199 | 303 | 0.7% |
| 94.055 | 229 | 0.6% |
| 94.027 | 233 | 0.6% |
| 93.994 | 7762 | |
| 93.918 | 6681 | |
| 93.876 | 212 | 0.5% |
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -40.50286332 |
| Minimum | -50.8 |
|---|---|
| Maximum | -26.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 41176 |
| Negative (%) | 100.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | -50.8 |
|---|---|
| 5-th percentile | -47.1 |
| Q1 | -42.7 |
| median | -41.8 |
| Q3 | -36.4 |
| 95-th percentile | -33.6 |
| Maximum | -26.9 |
| Range | 23.9 |
| Interquartile range (IQR) | 6.3 |
Descriptive statistics
| Standard deviation | 4.627859965 |
|---|---|
| Coefficient of variation (CV) | -0.1142600692 |
| Kurtosis | -0.3590970525 |
| Mean | -40.50286332 |
| Median Absolute Deviation (MAD) | 4.4 |
| Skewness | 0.3028760001 |
| Sum | -1667745.9 |
| Variance | 21.41708785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -36.4 | 7762 | |
| -42.7 | 6681 | |
| -46.2 | 5793 | |
| -36.1 | 5173 | |
| -41.8 | 4374 | |
| -42 | 3615 | |
| -47.1 | 2457 | 6.0% |
| -31.4 | 770 | 1.9% |
| -40.8 | 715 | 1.7% |
| -26.9 | 446 | 1.1% |
| Other values (16) | 3390 |
| Value | Count | Frequency (%) |
| -50.8 | 128 | 0.3% |
| -50 | 282 | 0.7% |
| -49.5 | 204 | 0.5% |
| -47.1 | 2457 | 6.0% |
| -46.2 | 5793 | |
| -45.9 | 10 | < 0.1% |
| -42.7 | 6681 | |
| -42 | 3615 | |
| -41.8 | 4374 | |
| -40.8 | 715 | 1.7% |
| Value | Count | Frequency (%) |
| -26.9 | 446 | 1.1% |
| -29.8 | 267 | 0.6% |
| -30.1 | 357 | 0.9% |
| -31.4 | 770 | 1.9% |
| -33 | 172 | 0.4% |
| -33.6 | 177 | 0.4% |
| -34.6 | 174 | 0.4% |
| -34.8 | 264 | 0.6% |
| -36.1 | 5173 | |
| -36.4 | 7762 |
| Distinct | 316 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.621293448 |
| Minimum | 0.634 |
|---|---|
| Maximum | 5.045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 0.634 |
|---|---|
| 5-th percentile | 0.797 |
| Q1 | 1.344 |
| median | 4.857 |
| Q3 | 4.961 |
| 95-th percentile | 4.966 |
| Maximum | 5.045 |
| Range | 4.411 |
| Interquartile range (IQR) | 3.617 |
Descriptive statistics
| Standard deviation | 1.734437004 |
|---|---|
| Coefficient of variation (CV) | 0.4789551106 |
| Kurtosis | -1.40679132 |
| Mean | 3.621293448 |
| Median Absolute Deviation (MAD) | 0.108 |
| Skewness | -0.7091942126 |
| Sum | 149110.379 |
| Variance | 3.00827172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.857 | 2868 | 7.0% |
| 4.962 | 2611 | 6.3% |
| 4.963 | 2487 | 6.0% |
| 4.961 | 1902 | 4.6% |
| 4.856 | 1210 | 2.9% |
| 4.964 | 1175 | 2.9% |
| 1.405 | 1169 | 2.8% |
| 4.965 | 1070 | 2.6% |
| 4.864 | 1044 | 2.5% |
| 4.96 | 1013 | 2.5% |
| Other values (306) | 24627 |
| Value | Count | Frequency (%) |
| 0.634 | 8 | < 0.1% |
| 0.635 | 43 | |
| 0.636 | 14 | < 0.1% |
| 0.637 | 6 | < 0.1% |
| 0.638 | 7 | < 0.1% |
| 0.639 | 16 | < 0.1% |
| 0.64 | 10 | < 0.1% |
| 0.642 | 35 | |
| 0.643 | 23 | |
| 0.644 | 38 |
| Value | Count | Frequency (%) |
| 5.045 | 9 | < 0.1% |
| 5 | 7 | < 0.1% |
| 4.97 | 172 | 0.4% |
| 4.968 | 991 | 2.4% |
| 4.967 | 643 | 1.6% |
| 4.966 | 620 | 1.5% |
| 4.965 | 1070 | |
| 4.964 | 1175 | |
| 4.963 | 2487 | |
| 4.962 | 2611 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5167.03487 |
| Minimum | 4963.6 |
|---|---|
| Maximum | 5228.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 321.8 KiB |
Quantile statistics
| Minimum | 4963.6 |
|---|---|
| 5-th percentile | 5017.5 |
| Q1 | 5099.1 |
| median | 5191 |
| Q3 | 5228.1 |
| 95-th percentile | 5228.1 |
| Maximum | 5228.1 |
| Range | 264.5 |
| Interquartile range (IQR) | 129 |
Descriptive statistics
| Standard deviation | 72.25136397 |
|---|---|
| Coefficient of variation (CV) | 0.01398313845 |
| Kurtosis | -0.003539670085 |
| Mean | 5167.03487 |
| Median Absolute Deviation (MAD) | 37.1 |
| Skewness | -1.044317057 |
| Sum | 212757827.8 |
| Variance | 5220.259596 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5228.1 | 16228 | |
| 5099.1 | 8532 | |
| 5191 | 7762 | |
| 5195.8 | 3682 | 8.9% |
| 5076.2 | 1662 | 4.0% |
| 5017.5 | 1070 | 2.6% |
| 4991.6 | 773 | 1.9% |
| 5008.7 | 650 | 1.6% |
| 4963.6 | 635 | 1.5% |
| 5023.5 | 172 | 0.4% |
| Value | Count | Frequency (%) |
| 4963.6 | 635 | 1.5% |
| 4991.6 | 773 | 1.9% |
| 5008.7 | 650 | 1.6% |
| 5017.5 | 1070 | 2.6% |
| 5023.5 | 172 | 0.4% |
| 5076.2 | 1662 | 4.0% |
| 5099.1 | 8532 | |
| 5176.3 | 10 | < 0.1% |
| 5191 | 7762 | |
| 5195.8 | 3682 |
| Value | Count | Frequency (%) |
| 5228.1 | 16228 | |
| 5195.8 | 3682 | 8.9% |
| 5191 | 7762 | |
| 5176.3 | 10 | < 0.1% |
| 5099.1 | 8532 | |
| 5076.2 | 1662 | 4.0% |
| 5023.5 | 172 | 0.4% |
| 5017.5 | 1070 | 2.6% |
| 5008.7 | 650 | 1.6% |
| 4991.6 | 773 | 1.9% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | output | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 56 | housemaid | married | basic.4y | no | no | no | telephone | may | mon | 261 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 1 | 1 | 57 | services | married | high.school | unknown | no | no | telephone | may | mon | 149 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 2 | 2 | 37 | services | married | high.school | no | yes | no | telephone | may | mon | 226 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 3 | 3 | 40 | admin | married | basic.6y | no | no | no | telephone | may | mon | 151 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 4 | 4 | 56 | services | married | high.school | no | no | yes | telephone | may | mon | 307 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 5 | 5 | 45 | services | married | basic.9y | unknown | no | no | telephone | may | mon | 198 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 6 | 6 | 59 | admin | married | professional.course | no | no | no | telephone | may | mon | 139 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 7 | 7 | 41 | blue-collar | married | unknown | unknown | no | no | telephone | may | mon | 217 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 8 | 8 | 24 | technician | single | professional.course | no | yes | no | telephone | may | mon | 380 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
| 9 | 9 | 25 | services | single | high.school | no | yes | no | telephone | may | mon | 50 | 1 | 999 | 0 | nonexistent | 1.1 | 93.994 | -36.4 | 4.857 | 5191.0 | no |
Last rows
| df_index | age | job | marital | education | default | housing | loan | contact | month | day_of_week | duration | campaign | pdays | previous | poutcome | emp.var.rate | cons.price.idx | cons.conf.idx | euribor3m | nr.employed | output | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41166 | 41178 | 62 | retired | married | university.degree | no | no | no | cellular | nov | thu | 483 | 2 | 6 | 3 | success | -1.1 | 94.767 | -50.8 | 1.031 | 4963.6 | yes |
| 41167 | 41179 | 64 | retired | divorced | professional.course | no | yes | no | cellular | nov | fri | 151 | 3 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41168 | 41180 | 36 | admin | married | university.degree | no | no | no | cellular | nov | fri | 254 | 2 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41169 | 41181 | 37 | admin | married | university.degree | no | yes | no | cellular | nov | fri | 281 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41170 | 41182 | 29 | unemployed | single | basic.4y | no | yes | no | cellular | nov | fri | 112 | 1 | 9 | 1 | success | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41171 | 41183 | 73 | retired | married | professional.course | no | yes | no | cellular | nov | fri | 334 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41172 | 41184 | 46 | blue-collar | married | professional.course | no | no | no | cellular | nov | fri | 383 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41173 | 41185 | 56 | retired | married | university.degree | no | yes | no | cellular | nov | fri | 189 | 2 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |
| 41174 | 41186 | 44 | technician | married | professional.course | no | no | no | cellular | nov | fri | 442 | 1 | 999 | 0 | nonexistent | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | yes |
| 41175 | 41187 | 74 | retired | married | professional.course | no | yes | no | cellular | nov | fri | 239 | 3 | 999 | 1 | failure | -1.1 | 94.767 | -50.8 | 1.028 | 4963.6 | no |